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' Abstract 

\ We present the first general bounds on the mixing time of the Markov chain associated to 

■ the logit dynamics for wide classes of strategic games. The logit dynamics with inverse noise 
\ (3 describes the behaviour of a complex system whose individual components act selfishly 
• and keep responding according to some partial (noisy) knowledge of the system, where the 

^| \ capacity of the agent to know the system and compute her best move is measured by the 

inverse of the parameter /3. 

In particular, we prove nearly tight bounds for potential games and games with dominant 
strategies. Our results show that, for potential games, the mixing time is upper and lower 
bounded by an exponential in the inverse of the noise and in the maximum potential dierence. 
Instead, for games with dominant strategies, the mixing time cannot grow arbitrarily with 
\ the inverse of the noise. 

■ Finally, we refine our analysis for a subclass of potential games called graphical coordina- 
O | tion games, a class of games that have been previously studied in Physics and, more recently, 

in Computer Science in the context of diffusion of new technologies. We give evidence that 
£NJ | the mixing time of the logit dynamics for these games strongly depends on the structure of 

the underlying graph. We prove that the mixing time of the logit dynamics for these games 
can be upper bounded by a function that is exponential in the cutwidth of the underlying 
graph and in the inverse of noise. Moreover, we consider two specific and popular network 
topologies, the clique and the ring. For games played on a clique we prove an almost match- 
ing lower bound on the mixing time of the logit dynamics that is exponential in the inverse 
of the noise and in the maximum potential dierence, while for games played on a ring we 
. prove that the time of convergence of the logit dynamics to its stationary distribution is 

significantly shorter. 
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1 Introduction 

Complex systems are often studied by looking at their dynamics and the equilibria induced by 
these dynamics. In this paper we concentrate on specific complex systems arising from strategic 
games. Here we have a set of selfish agents or players, each with a set of possible actions 
or strategies. An agent continuously evaluates her utility or payoff, that depends on her own 
strategy and on the strategies played by the other agents. A dynamics specifies the rule used by 
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the players to update their strategies. In its most general form an equilibrium is a distribution 
over the set of states that has the property of being invariant with respect to the dynamics. For 
example, a very well-studied dynamics for strategic games is the best response dynamics whose 
associated equilibria are the Nash equilibria. 

There are several characteristics of a dynamics and of the associated equilibrium concept 
that concur to make the dynamics descriptive of a system. First of all, it is desirable that the 
dynamics gives only one equilibrium state or, in case a system admits more than one equilibrium 
for a given dynamics, that the equilibria look similar. For example, this is not the case for Nash 
equilibria as a game can admit more than one Nash equilibrium and sometimes the equilibria 
have strikingly different characteristics. In addition, the dynamics must be descriptive of the 
way individual agents behave. For example, the best response dynamics is well-tailored for 
modeling players that have a complete knowledge of the global state of the system and of their 
payoffs. Finally, if a dynamics takes very long time to reach an equilibrium then the system 
spends most of its life outside of the equilibrium and thus knowledge gained from the study of 
the equilibrium is not very relevant. 

In this work we study a specific noisy best-response dynamics, the logit dynamics (defined in 
[6]) in which, at each time step, a player is randomly selected for strategy update and the update 
is performed with respect to a "noisy" knowledge of the game and of the state of the system, 
that is, the strategies currently played by the players. Intuitively, "high noise" represents the 
situation where players choose their strategies "nearly at random" because they have a limited 
knowledge of the system; instead, "low noise" represents the situation where players "almost 
surely" play the best response; that is, they pick the strategies yielding high payoff with "much 
higher" probability. After a sufficiently large number of steps, the probability that the system 
is found in a specific profile remains essentially unchanged and we say that the logit dynamics 
has converged to a stationary distribution, that is unique and independent of the starting state. 
We believe that this makes the logit dynamics the elective choice of a dynamics for large and 
complex systems in which agents have limited knowledge. However, one more step is needed 
to complete the picture. How long does the logit dynamics take to converge to the stationary 
distribution? This is the main technical focus of this paper. Specifically, we study the mixing 
time of the logit dynamics, that is, the time needed to get close to the stationary distribution. 
This depends on the underlying game and on the noise of the system (roughly speaking, the 
payoffs and how much players care about them). Since previous work has shown that the mixing 
time can vary a lot (from linear to exponential [3]) it is natural to ask the following questions: 
(1) How do the noise level and the structure of the game affect the mixing time? (2) Can the 
mixing time grow arbitrarily? 

In order to answer above questions, we give general bounds on the mixing time for wide 
classes of games. Specifically, we prove in Section [3] that, for all potential games, the mixing 
time of the logit dynamics is upper-bounded by a polynomial in the number of players and by 
an exponential in the rationality level and in some structural properties of the game. However, 
for very small values of f3 the mixing time is always polynomial in the number of players 

We complement the upper bound with a lower bound showing that there exist potential 
games with mixing time exponential in the rationality level. Thus the mixing time can grow 
indefinitely in potential games as (3 increases. In Section [5] we also study a special class of 
potential games, the graphical coordination games: we extend the result given in [5] for the 
Ising model; then, we give a more careful look at two extreme and well-studied cases, the clique 
and the ring. 

Going to the second question, in Section HJ we show that for games with dominant strategies 
(not necessarily potential games) the mixing time cannot exceed some absolute bound T which 
depends uniquely on the number of players n and on the number of strategies m. Though 
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T = T(n,m) is of the form 0(m n ), it is independent of the rationality level and we show that, 
in general, such an exponential growth is the best possible. 

Our results suggest that the structural properties of the game are important for the mixing 
time. For high (3, players tend to play best response and for those games that have more than 
one pure Nash equilibrium (PNE) with similar potential the system is likely to remain in a PNE 
for a long time, whereas the stationary distribution gives each PNE approximately the same 
weight. This happens for (certain) potential games, whence the exponential growth of mixing 
time with respect to the rationality level. On the contrary, for games with dominant strategies 
there is a PNE (a dominant profile) with high stationary probability and players are guaranteed 
to play that profile with non- vanishing probability (regardless of the rationality level). 

Related works. The logit dynamics was first studied by Blume [6J who showed that, for 2x2 
coordination games, the long-term behavior of the system is concentrated in the risk dominant 
equilibrium (see [TU] ) . The study of the mixing time of the logit dynamics for strategic games has 
been initiated in [3], where, among others, bounds were given for the class of 2 x 2 coordination 
games studied in [UJ. Before the work reported in 0J, the rate of convergence was studied only 
for the hitting time of specific profiles; see for example the work by Asadpour and Saberi [1] 
who studied the hitting time of the Nash equilibrium for a class of congestion games. 

Graphical coordination games are often used to model the spread of a new technology in 
a social network [18] with the strategy of maximum potential corresponding to adopting the 
new technology; players prefer to choose the same technology as their neighbors and the new 
technology is at least as preferable as the old one. Ellison [9] studied the logit dynamics 
for graphical coordination games on rings and showed that some large fraction of the players 
will eventually choose the strategy with maximum potential. Similar results were obtained by 
Peyton Young [T3] for the logit dynamics and for more general families of graphs. Montanari and 
Saberi [16] gave bounds on the hitting time of the highest potential equilibrium for the logit 
dynamics in terms of some graph theoretic properties of the underlying interaction network. 
We notice that none of [6j El [18] gave bounds on the convergence rate of the dynamics, while 
Montanari and Saberi [TB] studied the convergence time of a specific configuration, namely the 
hitting time of the highest potential equilibrium. 

Our work is also strictly related to the well-studied Glauber dynamics on the Ising model 
(see, for example, p3] and Chapter 15 of [12]). Indeed, the Ising model can be seen as a special 
graphical coordination game without risk dominant equilibria, and the Glauber dynamics on 
the Ising model is equivalent to the logit dynamics. In particular, we note that Berger et al. [5] 
relate the mixing time of the Ising model to the cutwidth of the underlying graph. Their results 
can be specialized to derive upper bounds on the mixing time of graphical coordination games 
without risk dominant equilibria. However the bounds we present in Section [5] are tighter. 

Even if the logit dynamics has attracted a lot of attention in different scientific communities, 
many other promising dynamics that deal with partial or noise-corrupted knowledge of the game 
have been proposed (see, for example, the recent work of Marden et al. [123 an d of Mertikopoulos 
and Moustakas [15] and references in [T9]). 

Paper organization. We give formal definitions of logit dynamics and some of the used 
techniques in Section [2j The upper bounds for potential games, for games with dominant 
strategies, and for graphical coordination games are given in Section O Section HI and Section 
respectively. 
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2 Preliminaries 



In this section we recall basic notions about strategic games and Markov chains, introduce the 
logit dynamics and describe the proof techniques that we will use for deriving upper and lower 
bounds on the mixing time. 

Games. In a strategic game Q we have a finite set of players {1, . . . , n}, and each player i has 
a finite set Si of strategies and a utility function m : S\ x • • • x S n — > R. A strategy profile of Q 
is a vector x = {x\, . . . , x n ) with X{ G Sf, that is, in profile x, player i chooses strategy Xi G Si. 
The utility (or payoff) of player i in profile x is ■Uj(x). Throughout the paper we adopt the 
standard game theoretic notation and write (a, x_j) for the profile obtained from x by replacing 
the i-th entry with a; i.e., (a, x_j) = (x±, . . . , a, Xj+i, . . . , x n ). We also denote with S the 
set 5 := Si x • • ■ x S n of all strategy profiles of the game. 

A strategic game Q is a potential game if there exists a potential function : S — > M. such 
that for every player i, every pair of strategies a,b G Si, and every profile x G <S, it holds that 

Ui(a, x_j) - ttj(6, x_j) = $(6, x_j) - $(a, x_j) . (1) 

Markov chains. A sequence of random variables (Xq, Xi, . . .) is a Markov chain Ai with 
sta£e space and transition matrix P if for all x,y G f2, all t ^ 1, and all events flf_ i = 
Pls=o{A^s = £ s } satisfying P (H t -\ D {A 4 = x}) > 0, we have 

P (X m = y | H t _ a U {X t = x}) = P (A m = y | X t = x) = P(x,y) . 

We denote with P x (•) and [•] the probability and the expectation conditioned on the starting 
state of the Markov chain being x, i.e., on the event {Xq = x}. The t-step transition matrix P t 
sets P t (x,y) = P x (X t = y). 

A Markov chain A4 is irreducible if for any two states x, y € O there exists an integer t = 
t{x, y) such that P t (x, y) > 0; i.e., it is possible to reach any state from any other one. The period 
of an irreducible Markov chain is the greatest common divisor of {t ^ 1 : 3x such that P (x, x) > 
0}. If the period of a Markov chain is greater than 1, then the chain is called periodic, otherwise 
aperiodic. If a Markov chain is finite (i.e., the space state Q is a finite set), irreducible and 
aperiodic then the chain is ergodic: for an ergodic chain there is an integer r such that, for all 
x,y € Q, P r (x,y) > 0. 

It is a classical result that if hA is ergodic there it converges to an unique stationary dis- 
tribution. That is, there exists a distribution ir on f2 such that ttP = n and, for every initial 
profile x € f2, the distribution P t {x,-) of the chain at time t converges to ir as t goes to infinity. 

A Markov chain A4 is reversible if for all x, y S £1, it holds that 

ir(x)P(x,y) = n(y)P(y,x) . 

The probability distribution Q{x, y) = tt{x)P{x, y) over f2xQ is sometimes called edge stationary 
distribution. 

The mixing time of a Markov chain is the time needed for P (x, •) to be close to ir for every 
initial state x. Specifically, 

tmix(e) : = min{t G N: \\P\x, •) - vr|| TV < e for all x G 17} , 

where ||P*(a;, •) — 7r | | tv = \ S^er2 l^i 00 ^ V) ~ n (y)\ ^ s ^ ne total variation distance among the 
probability distributions P t and tt. We will use the standard convention of setting t m ; x = 
imix(l/4) and observe that t m i x (e) ^ ^mix(l/4) ■ logl/e. 
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Logit dynamics. The logit dynamics (see [S]) for a strategic game Q runs as follows: at every 
time step a player i is selected uniformly at random and she updates her strategy to y € Si 
with probability <7j(y | x) defined as 

aAy | x) :=— l-e^fo*-*), (2) 

where x 6 S is the current strategy profile, Tj(x) = X^eS e^ Ui ^ z '^~ i ' is the normalizing factor, 
and parameter f3 ^ is the inverse noise (or inverse temperature). 

The logit dynamics for defines a Markov chain TW^(ty) = {Xt : t € N} with state space 
5 = Si x • • • x S n and transition probabilities 



P(x,y) = - 

72, 



CTj(yi I x), if x / y and x„j = y_i; 

EiLi^fi I x )' if x = y; ( 3 ) 

k 0, otherwise. 



We will find convenient to identify the logit dynamics for Q with the Markov chain M^{Q). It 
is easy to see that M^(G) is ergodic. Therefore, there exists a unique stationary distribution it 
and, for every initial profile x, the distribution P*(x, •) of the chain at time t converges to tt as 
t goes to infinity. 

It is easy to see that, if Q is a potential game with potential function <3>, then M^(Q) is 
reversible and the stationary distribution is the Gibbs measure 

tt(x) = i^*W (4) 

where Z = ^2 ye g e^*( y ) is the normalizing constant (also called the partition function). We will 
write Z/3 and ix^ when we will need to stress the dependence on the inverse noise /?. 



Further notation. We use bold symbols for vectors. We denote by d(x, y) the Ramming 
distance between x, y, that is the number of coordinates in which these two vectors differ. 
Given a set A of vectors, the Hamming graph on A is the graph with vertex set A and an edge 
between x and y if and only if d(x, y) = 1. For each edge e = (x, y) of the Hamming graph we 
say that e goes along the dimension i of the Hamming graph if x and y differ on the position i 
and denote its dimension by A(e). 



2.1 Proof techniques 

In order to derive our bounds on the mixing time of the logit dynamics we will use the following 
well-established techniques: Markov chain coupling and spectral methods for the upper bounds 
and the bottleneck ratio theorem for the lower bounds. In the rest of this section we will state 
the the theorems we will use for obtaining our bounds. For a more detailed description we refer 
the reader to [T2"] . 



Markov chain coupling. A coupling of two probability distributions \i and v on Q is a pair 
of random variables (X, Y) defined on £1 x such that the marginal distribution of X is \i and 
the marginal distribution of Y is v. A coupling of a Markov chain A4 with transition matrix P 
is a process (Xt,Yt)^2. with the property that both Xt and Yj are Markov chains with transition 
matrix P. When the two coupled chains start at (Xq,Yq) = (x,y), we write P x ,y (•) and E x>y [■] 
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for the probability and the expectation on the space where the two coupled chains are both 
defined. We denote by r coup i e the first time the two chains meet; that is, 

Tcouplc = min{t : X t = Y t }. 

We will consider only couplings of Markov chains with the property that for s ^ T coup i c , it holds 
that X s = Y s . The following theorem establishes that the total variation distance between 
P t (x, •) and P (y, ■), the distributions of the chain at time t starting at x and y respectively, 
is upper bounded by the probability that two coupled chains starting at x and y have not yet 
meet at time t (see, for example, Theorem 5.2 in |12j). 

Theorem 2.1 (Coupling). Let M be a Markov chain with finite state space £1 and transition 
matrix P. For each pair of states x,y £ f2 consider a coupling (Xt, If) of M with starting states 
Xq = x and Yq = y. Then 

\\P t (x r )-P t (y r )\\ T y *S ~Px,y (^couple > t) . 

Since for an ergodic chain with transition matrix P and stationary distribution tt, for every 
x € fl it holds that ||P*(;e, •) — vr|| TV ^ maxyg^ ||P*(x, •) — P t (y, OIItv ^ e a ' 30ve theorem gives 
a useful tool for upper bounding the mixing time. 

Sometimes it is difficult to specify a coupling and to analyze the coupling time r coup i c for 
every pair of starting states (x,y). The path coupling theorem says that it is sufficient to define 
a connected graph over the Markov chain state space and consider only couplings of pairs of 
Markov chains starting from adjacent states. An upper bound on the mixing time can then 
be obtained if each one of those couplings contracts its distance on average. More precisely, 
consider a Markov chain A4 with state space and transition matrix P; let G = (0, E) be 
a connected graph and let w : E — > M. be a function assigning weights to the edges such that 
w(e) ^ 1 for every edge e££; for every 1,1/6 0, we denote by p(x, y) the weight of the shortest 
path in G between x and y. The following theorem holds. 

Theorem 2.2 (Path Coupling [7]). Suppose that for every edge (x,y) £ E a coupling (X t ,Y t ) 
of M. with Xq = x and Yq = y exists such that E x ,y [p(Xi, Yi)] ^ e~ a ■ w({x, y}) for some a > 0. 
Then 

log(diam(G9)+log(l/ £ ) 

Z-mixV^J ^ 

a 

where diam(G) is the (weighted) diameter of G. 



Spectral methods. Let P be the transition matrix of a Markov chain with finite state space 
Q and let us label the eigenvalues of P in non-increasing order 

Ai ^ A 2 ^ • • • > A|q|. 

It is well-known (see, for example, Lemma 12.1 in [12] ) that Ai = 1 and, if P is ergodic then 
A2 < 1 and Aiqi > —1. We denote by A* the largest absolute value among eigenvalues other 
than Ai. The relaxation time t rc \ of an ergodic Markov chain M is defined as 

1 

t rp ] = — = max 



1-A* 1 1 - A 2 ' 1 + A|o, 

The relaxation time is related to the mixing time by the following theorem (see, for example, 
Theorems 12.3 and 12.4 in [12]). 
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Theorem 2.3 (Relaxation time). Let P be the transition matrix of a reversible ergodic Markov 
chain with state space f2 and stationary distribution tt. Then 

(* re l - 1) • log ( ) < tmix(e) < *rel ' log ' 



2e J \ en- 



where 7r min = min^n vr(x). 



The following theorem shows how to use the coupling technique to obtain an upper bound 
on the relaxation time as well (see Theorem 13.1 in |12j). 

Theorem 2.4 ([5]). Let P the transition matrix of a Markov chain Ai with state space £1 and 
let p be a metrics on Q. Suppose there exists a constant 8 < 1 such that for each x,y 6 Q, there 
exists a coupling (X, Y) ofP(x,-) and P(y,-) such that 

E x , y [p(X,Y)]^6-p(x,y). 

Then the relaxation time of M is t rc \ ^ . 

Consider two ergodic reversible Markov chains Ai and Ai over the same state set O. We 
denote by tt and tt and by Q and Q the respective stationary distributions and edge stationary 
distributions. We define the set of the edges of Ai as the set of pairs x,y € Q such that 
P(x, y) > 0. For x,y € Q, an Ai-path is a sequence F x>y = (ei, . . . , e&) of edges of Ai such that 
ei = (xi-i, Xi), for i = 1, . . . , m, and xq = x and x m = y. The length k of path T XjV is denoted 
by (r^^j. Let T be a set of paths T = {T XtV }, one for each edge (x,y) of Ai. We define the 
congestion ratio a of T as 

/ , \ 



a = max 

e&E 



\ eer B)1) / 



where is the set of edges of .M. 

The following theorem relates A2 and A2 , the second eigenvalues of of Ai and Ai , respectively. 

Theorem 2.5 (Path Comparison Theorem). Let Ai and Ai be two ergodic reversible Markov 
chains over the same state space SI and let A2 and A2 be their respective second eigenvalues. If 
there exists a set V of Ai -paths, containing one path for each edge of At, with congestion ratio 
a, then 

1 1 



1 - A 2 1 - A 2 

where 7 = max^n tt(x)/tt(x). 

The following theorem is a special case of the Path Comparison Theorem obtained by consid- 
ering a Markov chain Ai with stationary distribution tt and a Markov chain Ai with transition 
probability P(x,y) = ir(y). 

Theorem 2.6 (Canonical paths |llj). Let Ai be a reversible ergodic Markov chain over state 
space O with transition matrix P and stationary distribution tt. For each pair of profiles x,y € 0, 
let r x y be an Ai-path. The congestion p of the set of paths is defined as 

( \ 



max 



7r(a;)7r(y)[r X); 



1 > x,y: 1 



Then it holds that 1 _ 1 Aa ^ p. 
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Bottleneck ratio. Let A4 = {Xt : t € N} be an ergodic Markov chain with finite state space 
Q, transition matrix P, and stationary distribution tt. For a set of states R C fi, the bottleneck 
ratio at R is defined as 

7t(R) 

where Q(R,R) = J2 X £R yen Q( x -> V)- The following theorem states that, for every R with 
ir(R) ^ 1/2, the mixing time is larger than the reciprocal of the bottleneck ratio at R, up to a 
constant factor (see, for example, Theorem 7.3 in 



Theorem 2.7 (Bottleneck ratio). Let Ai = {Xt : t E N} be an ergodic Markov chain with 
finite state space £1, transition matrix P, and stationary distribution tt. Let R C Q be any set 
with tt(R) ^1/2. Then the mixing time is 

1 -2e 

trails) > ^r^y . 



3 Potential games 

In this section we give bounds on the mixing time of the Markov chains of the logit dynamics 
with inverse noise (3 of potential games. 

We start by proving a property of the eigenvalues of the transition matrix of the Markov 
chain of the logit dynamics with inverse noise /3 of a potential game. This result, that we think 
to be interesting by itself, shows that the relaxation time only depends on the second eigenvalue 
of the transition matrix. We then give bounds for three different ranges of /3. Our first bound 
holds for all values of /3. Then we show a slightly better bound for low values of f3. Finally, we 
give a more precise bound on the mixing time for high values of (3. 



3.1 Eigenvalues of the Logit Dynamics 

We will show that the second eigenvalue of the transition matrix of the Markov chain of the 
logit dynamics with inverse noise /3 for a potential game is always larger in absolute value than 
the last eigenvalue. Hence, for these games, t m \ = . 

Theorem 3.1. Let Q be an n-player potential game with profile space S and let P be the 
transition matrix of the Markov chain of the logit dynamics with inverse noise f3 for Q . Let 
1 = Ai A2 ^ . . . ^ A151 be the eigenvalues of P. Then A2 A151 . 

Proof. To prove the theorem we will show that all eigenvalues are non-negative which implies 
that A|5| ^ 0. Then, we have that |A|5|| = A151 ^ A2. 

Assume for sake of contradiction that there exists an eigenvalue A < of P and let / be an 
eigenfunction of A. By definition, / 7^ 0. Since A < 0, then for every profile xES such that 
/(x) / 0, we have sign ((P/)(x)) = sign (A/(x)) / sign (/(x)) and thus 

(Pf,f)n :=^7r(x)(P/)(x)/(x)<0, 

where tt is the stationary distribution of the Markov chain of the logit dynamics with inverse 
noise /3 for Q. 

For every player i and for every strategy sub-profile z_j, we consider the single-player matrix 
p0,z_i) d e f] nec i as 

P (i,^) fx vl . = 1 / ^ Ui{y \ ^ = y-i = ; 

K ,yj ' Ti(z^) I 0, otherwise. 
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The transition matrix P is the sum of all such "single-player" matrices: 

p = 1 V V p (i ' z - i) 

Let us define 

S't.z-j := {x | x = (si,z_») and s* G 5f} . 
For any x, y € <S'i jZ _ i we have that 

P /3«i(y) „,,. , ,. ,. P /3«i( x ) P /3«i(y) 

e -/J(*(y)-*(x)) wMch implieg 



e /3«i(x) ° ^ e -/3*(x) e -/3*(y) " 

Thus, the ratio ri ;Z _ i := is constant over all z £ Si tZ _ i . Hence, whenever P^ ,z -^(x,y) is 

not zero, it does not depend on x: indeed, 

Setting Cj jZj = r ^' z ~^' , we obtain that 



( p(i^) /)/)7r = Ci ^ £ £ ,(xWy)/(x)/(y) = C !iZ J £ tt(x)/(x) £ . 
From the linearity of the inner product, it follows that 

(p/,/) 7r = -V^(p^-*)/,/)^o ; 

contradicting the hypothesis. □ 



3.2 Bounds for all p 

In this section, we give an upper bound on the mixing time of the Markov chain of the logit 
dynamics with inverse noise (3 for potential games. The upper bound is expressed in terms of 
the maximum global variation A<3? of the potential function <I>: 

A$ := max{$(x) - $(y)} = $ max - $ min 

where $ max and <I? m i n are the maximum and the minimum value of <J>, respectively. The upper 
bound holds for every value of the inverse noise /3. We shall also provide examples of games 
whose logit dynamics has a mixing time close to the given upper bound. 



The upper bound. Let Q be a strategic game with profile space 5 and let M@ be the Markov 
chain of the logit dynamics for Q with inverse noise /3. By Theorem 12.31 to get an upper bound 
on the mixing time of it suffices to give an upper bound on the relaxation time t^ el of A4^ . 
We obtain an upper bound on the relaxation time of by comparing it with A4° (the Markov 
chain of the logit dynamics with inverse noise j3 = 0). A4° is a random walk on a regular graph 
with \S\ vertices and its stationary distribution 7To is the uniform distribution ttq(x) = 1/|<S|. 
An upper bound on its relaxation time is given by the following lemma. 
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Lemma 3.2. Let Q be an n-player game. The relaxation time of the Markov chain A4° of the 
logit dynamics for Q with inverse noise /3 = is tjLj n. 

Proof. Let P be the transition matrix of M°. For any two profiles x,y of Q we define the 
following coupling (X, Y) of distributions P(x, •) and P(y, •): pick i E [n] and s € Si uniformly 
at random and update X and Y by setting Xi = s and Yi = s. Notice that when the coupling 
picks a player j on which x and y differ, the Hamming distance d(X, Y) of X and Y decreases 
by one; otherwise, it remains the same. Thus, 

E x , y [d(X, Y)\ = ^0 ^ (X) y) _ i) + ^ _ ^iTj d(Xj y) = (l - I) «f( x> y) . 

The lemma follows by applying Theorem 12.41 with 9 = (l — □ 

The following lemma is the main technical result of this section. 

Lemma 3.3. Let Q be a n-player potential game where each player has at most m strategies. 
Let A<3? be the maximum global variation of the potential <& of Q. Then the relaxation time of 
the Markov chain A4@ of the logit dynamics for Q with inverse noise (3 is 

^ < 2mn • e" A *. 

Proof. Fix j3 > and denote by 7Tq and irp the stationary distributions of M° and M.^ . For all 
x € S, vro(x) = jqt and 

e -/8*(x) g-ZJ^min 
7T/3(x) = < , 

where Zp = ^2 ye s e _ ^*( y ) is the partition function of A4 13 . Therefore, for all x, 

^M<7 With 7 =M. e -/31> mi n. 

7T (x) Zp 

Next we define an assignment of A'l^-paths to the edges of The edges (u, v) of Ai° (and 
of Ai 13 ) consist of two profiles u and v that differ only in the strategy of one player. With each 
edge of A4°, we associate a path consisting of at most two admissible edges of M? . We say 
that edge (u, z) of is an admissible edge if one of u and z minimizes the potential function 
$ over all profiles that are in the intersection of the neighborhoods of u and z. Notice that, 
since u and z differ exactly in one component, say the j-th, u_j = z_j and the intersection of 
the two neighborhoods consists of all profiles of the form (u_,, s) with s € Sj. 

If edge (u, v) of M° is admissible then it is associated with the path r U)V consisting solely of 
the edge (u, v) of MP . Otherwise, let z be the profile of minimum potential in the interesection 
of the neighborhoods of u and v. Path r U;V of edge (u, v) of A4° then consists of the edges 
(u, z) and (z,v) of M 13 . 

Let us now upper bound the congestion ratio a of the assignment. By the reversibility of Mr , 
we have that Qp(u,z) = Qp(z, u) and therefore, without loss of generality, we compute Qp(&) 
for e = (u, z) for which z is the minimum potential profile from the common neighborhood of 
u and z. For such an edge e, by denoting by i the one component in which u and z differ, we 

e -/3*(u) e -/3*(*0 e -/3* max i 

Qls{e) = Zf, n • £ aeSl e -W-*.-> > Z P n . Ea6gi ^(*W-*(.-..))- 
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Observe that 3>(z) ^ $(z_j, s) for all s E Si and thus each of the \Si\ terms in the sum at the 
denominator is at most 1. Hence 

g-jS*max I 



Zp n-\Si\ 



Now, edge e = (u, z) is used on all paths for edges (u, v) of M° for which u and v differ only 
in the i-th component. Thus we have Qo(u,v) = 1/\S\ ■ l/(n ■ \Si\). Finally, since there are 
exactly such edges of Ai° and their corresponding paths in Mr have length at most 2, we 
can write the congestion ratio for edge e = (u, z) of M,^ as 



eer u , v eGr Uiy 
By applying the Path Comparison Theorem (Theorem 12 .5p with 

a = . e /34w and 7 = M. e -P^ , 

1^1 ^8 

and since, by Theorem 13.1 1 = 1/(1 — Ami), we obtain that 

The lemma then follows from Lemma 13.21 □ 

We now give an upper bound on the mixing time of M 13 , for all @ > 0. 

Theorem 3.4. Let G be a n-player potential game where each player has at most m strategies. 
Let A$ be the maximum global variation of the potential $ of Q. Then the mixing time i mix of 
the Markov chain A4^ of the logit dynamics for Q with inverse noise (3 is 

£ mix (£) ^ 2mn ■ e^ A * ^log i + /3A$ + n log m\ . 

Proof. Use Theorem 12.31 the upper bound on the relaxation time provided by Lemma 13.31 and 
the fact that, for all x, vr^x) > 1/ (eP A *\S\). □ 

Remark. Notice that we can write the upper bound provided by Theorem 13.41 as 



''mix e 

where the asymptotic is in j3. 



The lower bound. In this section we provide a lower bound on the mixing time of the Markov 
chain of the logit dynamics with inverse noise (5 for potential games. We start by observing 
that it is easy to find a potential <3? for which the Markov chain of the logit dynamics with 
inverse noise f$ has mixing time f](e^ A *). Specifically, consider a potential $ on {0, l} n such 
that <&(0) = $(1) = and, for some L > 0, $(x) = L for all x / 0, 1. Then a bottleneck 
argument shows that the mixing time is Q(e^ L ). However, for this potential $ the maximum 
global variation A$ coincides with the maximum local variation 

8<f> = max{$(x) - <£(y) : (i(x, y) = 1} 

and thus it could be compatible with the fact that the mixing time depends on the maximum 
local variation and not on A<I>. The next theorem proves that the term e^ A * cannot be improved 
for values of f3 sufficiently large. 
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Theorem 3.5. For every sequences {g n } end {l n } such that 2 ■ g n /n ^ l n ^ g n , there exists 
a sequence of potentials {§ n } such that has maximum global variation z\$ n = g n and 
maximum local variation 5<& n = l n and the mixing time i mix of the Markov chain of the logit 
dynamics with inverse noise (5 fov the game with potential is 

JS > flA* n (l-o(l)) 
''mix e 

Proof. Fix n and set c = g n /l n - Consider potential <I> n on {0, l} n defined as 

$ n (x) = -l n ■ min{c, |c - w(x)\} , 

where w(x) denotes the number of l's in x. Notice that <53> n = l n , the minimum of $ n is 
3> n (0) = —cl n = —gn, while the maximum is and is attained by the states in the set M = 
{x £ {0, l} n : io(x) = c}. Therefore A<3? n = g n . Moreover, we have that if \w(x) — c\ = \w(y) — c\ 
then <& n (x) = <3? n (y); that is, $ n is symmetrical with respect to c (note that c ^ n/2, by 
hypothesis). 

Consider the Hamming graph on S \ M: it is not hard to see that this graph has two 
connected components. Let us name R the component that contains 0, i.e, 

R = {x 6 {0,l} n : io(x) < c}. 

By the symmetry of $ n , we have that ir(R) ^ ^ and thus we can apply the Bottleneck Ratio 
Theorem (Theorem 12. 7p to R. Let us now provide a bound on B(R). We start by observing 
that for every pair x, y of profiles that differ in player i, for some i € [n], it holds that 

e -0*(x) i e -/8*(y) e -/9*(y) 

Q(*,y) = — ^ ^ _3 $rx . < — ■ (5) 

We define dR as the set of profiles in R that have a neighbor in M, i.e., 

OR = {x € {0, l} n : u>(x) = c - 1} . 

Note that for every x € dR there are at most (m — l)n neighbors outside i? and all of them 
belong to M by definition, thus 

p -/3$(y) i 

= E E«(^)< E E^^K 1 )!^!. 

x&dRyGM xedRyeM 



where the last inequality follows from 3>(y) = for every y € M. Obviously, we have 

tt(R) > vr(O) = 

The last two inequalities yield 
Finally, observe that 

d ^e clogn = e **« . 



Thus, by Theorem 12.71 we have that the mixing time of Markov chain of the logit dynamics 
with inverse noise /3 for the game with, potential is 

fl 3 fVl > l ~ 2e . p/ 8A* n -(A* n /<5*n)logn _ /9A*(l-o(l)) 
tmix^J ^ 2 (m - 1) 6 " 6 

as /3 can be chosen large enough so that logn/<5<£ n = o(l). □ 
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3.3 An upper bound for small (3 

The upper bound provided by Theorem 13.41 is very weak as it gives a quadratic upper bound 
even for small /3. On the other hand, by Lemma 13.21 the mixing time of A4° is O(nlogn). In 
this section we will show that this bound holds for small values of (3 and not only for (3 = 0. 

Theorem 3.6. Let Q be an n-player potential game with maximum local variation 5&. // 
f3 ^ c/(n • 5&), for some constant c < 1, then the mixing time of the Markov chain of the 
logit dynamics for Q with inverse noise (3 is i^ ix = O(nlogn). 

Proof. Let P be the transition matrix of A4@ . For every pair of profiles x and y at Hamming 
distance (i(x,y) = 1, we describe a coupling (X, Y) of the distributions -P(x, ■) and P(y, •). 

For each player i, we partition two copies, Ix,i and Iy,%, of the interval [0, 1] in sub-intervals 
each labeled with a strategy from the set Si = {z±, . . . , z\s t \} of strategies of player i. For each 
U € [0, 1], we define the label of U with respect to X and i to be the label of the sub-interval of 
Ix,i that contains U. Similarly, we define the label of U with respect to Y and i. The coupling 
picks i £ [n] and U € [0, 1] uniformly at random. X and Y are then updated by setting the 
strategy of the i-th player equal to the label of U with respect to X and i and, similarly, Y is 
updated by setting the strategy of the i-th player equal to the label of U with respect to Y and 
i. 

Let us now describe how the sub-intervals are constructed. At the beginning Ix,% and Iy% are 
each considered as one unlabeled interval. As the construction of the sub-intervals progresses, 
we cut pieces from the unlabeled intervals and label them with a strategy. Specifically, the 
partitioning of Ix,i and Iy,i has one phase for each z 6 Si during which the leftmost portion of 
length / = min{<7j( | y)} of the unlabeled interval of both Ix % and Iy,i is labeled with 

z. In addition, the rightmost portion of length o~i{z | x) — I of the unlabeled interval of Ix,i and 
the rightmost portion of length Oi{z | y) — I of the unlabeled interval of Iy,i are both labeled 
with z. Notice that, for each z, at least one of these last two sub- intervals has length 0. When 
all z E S have been considered, all points of the two intervals belong to a labeled sub-interval. 
We have that (X, Y) is a coupling of P(x, •) and P(y, ■). Indeed, by construction the size of the 
sub-intervals labeled z in Ix,i is exactly a^z | x) and similarly for Iyj. 

Moreover, there exists li € (0, 1) such that all U ^ li have the same label with respect to 
X and Y; whereas, for all U > li the label with respect to X and the label with respect to Y 
differ. 

Let us now bound the expected distance d(X, Y) for profiles x and y differing in the j-th. 
player. We have three possible cases. 

• i = j- 

In this case, which has probability 1/n, we have that <Jj{z \ x) = aAz \ y) for all z £ Sj 
and thus £j = 1. As a consequence, for any U the same strategy is used to update the 
j-th player and we have that X = Y and thus d(X, Y) = 0. 

• i 7^ j and U ^ li. 

This case has probability ^ Si^j an d i n this case U has the same label with respect to 
X and Y. Therefore, the i-th player strategy is update in the same way, thus keeping the 
distance between X and Y equal to 1. 

• i j and U > li. 

This case has probability ^ — ^i) and in this case U has different labels with respect 

to X and Y. Therefore, the i-th player strategy is update differently in X and Y thus 
bringing the distance between X and Y up to 2. 
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We have that the expected distance between X and Y is 

E x , y [d(X, 7)] = -V (£i + 2(1 - £$) 

<(i-|)p-0 

where £ = min^ . 

Let us now give a lower bound on t. From the coupling construction, we have li = 
J2 z eSi m ^ n Wi(z | x),cri(z I y)}. Observe that for any profile x, any player i and any strat- 
egy z € Si it holds that 

e -0*(*-*.«) _ 1 1 

^ I x ) - £ e -^*(x-i,fc) ~ £ fc e ^[*(x_ ilZ )-$(x_ i ,fc)] ■ 



Hence 



4 = *>M*i(* | x), ^(z | y)} > £ — * = e "/"* > e~ c ' n 
zeSi zeSi 1 



where in the last inequality we used the fact that (3 < c/(n- 5$). Thus, the expected distance 
between X and Y is upper bounded by 

E x , y [d(X, Y)] < e" 1/n (2 -i)^ e" 1/n (2 - e" c/n ) 

= e- 1/n (l + (1 - e" c/n )) < e- 1/n (l + c/n) < e" 1/n e c/n = e~^r , 

where in the second line we repeatedly used the well-known inequality 1 + x ^ e x for every 
x > — 1. 

The thesis then follows by applying Theorem 12.21 with a = □ 
3.4 Bounds for large f3 

In this section we give another upper bound on the mixing time of the Markov chain of the 
logit dynamics with inverse noise (3 for potential games. The bound depends on a structural 
property of the potential function that measures the difficulty of visiting a profile starting from 
a profile with higher potential value. We will also show that the upper bound is tight for large 
values of j3. 

Let $ be an n-player potential defined over the set S of profiles of the game. We consider 
paths 7 over the Hamming graph of 5. More precisely, a path 7 is a sequence (xo,xi, . . . , x^) 
of profiles such that x$_i and Xj differ in the strategy of exactly one player, for i = 1, . . . , k. 
We denote by V^y the set of all paths from x to y. For a path 7 = (xo,xi, . . . , x^) with 
^(xo) J? <&(xfc), we denote by £(7) the maximum increase of the potential function along the 
path; that is, 



and set 



C(t) = max ($(xi) - $(x )) , 



C(x,y) = min £(7) 



Finally, we set C, = max Xi y £(x, y). Notice that £ ^ 0. 

Lemma 3.7. XTie relaxation time of the Markov chain of the logit dynamics with inverse noise 
j3 for a potential game with potential $ and at most m strategies for player satisfies 

4 < nm 2n+1 e^ . 
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Proof. Recall that an edge (u, z) of MP is admissible if one of u and z minimizes the potential 
in the intersection of the neighborhoods of u and z. To bound the relaxation time, we construct, 
for each pair (x, y) of profiles of a path 7 x>y in M.^ such that C(7x,y) = C( x >y) an d each 
edge of 7 X)Y is admissible. Every pair (x, y) of profiles has such a path. Indeed, let 7 be a 
path from x to y with £(7) = £(x, y) and let (u, v) be a non-admissible edge of 7. Then we 
replace the edge (u, v) with the two admissible edges (u, z) and (z,v), where z is the profile 
that minimizes the potential over the profiles in the intersection of the neighborhoods of u and 
v. Notice that the resulting path 7' has one fewer non-admissible edge and C(t') = C(t)- 

We next lower bound Q(u, z) for an an admissible edge (u, z). Notice that since the chain 
is reversible we have Q(u, z) = Q(z,u) and thus, without loss of generality, we can compute Q 
only for admissible edges (u, z) in which z has the minimum potential from among the common 
neighbor profiles of u and z. For such an admissible edge (u, z), by denoting by j the player at 
which u and z differ, we have 

1 e -/9*W 

Q(u, z) = 7r(u)P(u, z) = 7r(u) 



1 1 > 7r(u) 

where the last inequality follows from the fact that profile z has the minimum of the potential 
over profiles (u_j, s), with s E Sj, and thus all the addends of the sum are at most 1. 
Then, for every admissible edge (u, z), we obtain 

£ • |7x, y | < £ vr(y)e^H"*W) 

(x,y): ^ U ' Z J (x>y): 
(u,z)G7 x , y (u,z)G7 x , y 



^nm n+1 -e K £ vr(y) ( 6 ) 



(x,y): 

(U,z)e7x,y 

^ nm 2n+l e^ , 

where we used the fact that |7x,y| ^ m n and that admissible edge (u, z) can be used by at most 
m n paths. The lemma then follows from Lemma 12.61 and Theorem 13.11 □ 

Notice that, if ( > 0, then tf cl < e ^C(l+°(i)) (the asymptotic is on f3). By using this bound on 
relaxation time and the fact that 7r m i n ^ l/(e^ A *|5|), Theorem Ogives the following bound 
on the mixing time. 

Theorem 3.8. The mixing time of the Markov chain of the logit dynamics with inverse noise 
f3 for a potential game with potential £ > 0, and at most m strategies for player satisfies 

ffi < p0C(i+°(i)) 

''mix e 

The next theorem shows that this bound is almost tight. 

Theorem 3.9. The mixing time of the Markov chain of the logit dynamics with inverse noise 
j3 for a potential game with potential £ > 0, and at most m strategies for player satisfies 



tr > e 

''mix ^ L 



/3C(l-o(l)) 
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Proof. Let (x, y) be a pair such that C = C( x >y)j let 7x,y be a path such that C(7x,y) = C( x jy) 
and let z be the profile of maximum potential along 7x,y Therefore, £ = $(z) — $(x). Now 
consider the set of all paths from x to y and let M be the set of profiles that have maximum 
potential along at least one path of "Px.y Since £ > 0, then M ^ S and M 7^ 0. By removing 
all vertices in M, we disconnect x from y in the Hamming graph and we let -R X) m (respectively, 
Ry,M) be the component that contains x (respectively, y). 

Obviously, at least one of -R X ,A/ and Ry,M has total weight less than 1/2 in the stationary 
distribution n. We next show that there exists one of i? X) A/ and R Yt M, call it R, such that 
ir(R) < 1/2 and R contains a profile with potential ^(x). This is obvious when $(x) = ^(y). 
Since (x, y) attains ( it must be the case that ^(x) $(y) and thus we only need to consider 
the case $(x) > ^(y). We claim that in this case x has the minimum potential in R^m- 
Indeed, suppose 3>(x) > 3>(w) for some w G -R Xj m and consider the path 7 w>y from w to y 
such that C(7w,y) = C( w >y)- Observe that both endpoints of 7 W)Y have potential strictly lower 
than x. Moreover, since M disconnect w and y, 7 Z) y has to go through a profile of M with 
potential at least 3>(z). Then C(7z,y) > ^( z ) ~~ ^K x ) = C and this is a contradiction. Therefore, 
if <£(x) > $(y), then <&(w) > $(y) for each w € -R x ,a/ and hence 7r(y) > vr(i? XiA ./), for /3 
sufficiently large. Thus, if <3?(x) > $(y), then ir(R Xt M) ^ 1/2 for /3 sufficiently large. 

Next we provide a bound on B(R). Denote by dR the set of profiles in R that have a 
neighbor outside R (and hence in M) and note that each profile u G dR has at most n(m — 1) 
neighbors in M. We have 

„-/3$(v) 

where the first inequality follows from ([5]) and last inequality follows from 3>(v) > 3>(z) for 
every v G M. Obviously, we have 

e -/3*(x) 

ir(R) ^ vr(x) = 



Z 



The last two inequalities yield 



B (R) = Q ( R ^) <: ! m _ i) \ dR \ . e ~K . 
tt{R) 

Thus, by Theorem 12.71 we have that the mixing time of Markov chain of the logit dynamics 
with inverse noise /3 for the gctniG witli potential is 

iP (e) = l ~ 2e 

mixV ' 2(m - 1) \dR\ 

and the theorem follows by taking f3 sufficiently large. □ 



4 Mixing time independent of (3 

Theorems 13.91 and 13.51 show that there are games where the mixing time grows with j3. In this 
section we show that if Q has a dominant profile then the mixing time of its logit dynamics is 
upper bounded by a constant independent of (3. We say that strategy s G Si is a dominant 
strategy for player i if for all s' G Si and all strategy profiles x G S, 

Ui(s,X-i) ^ Uj(s',X_j) . 
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A dominant profile (si, . . . ,s n ) is a profile in which Sj is a dominat strategy for player i = 
1,... ,n. 

Let Q be a game with a dominant profile. We assume without of loss of generality that all 
players have the same dominant strategy and let us name it 0. Thus profile = (0, . . . , 0) is 
the dominant profile. It is easy to see that the following holds for the logit dynamics of Q. 

Observation 4.1. Let Q be an n-player game with dominant strategies. Then for every profile 
x, for every {3 ^ and for every i = 1, • • • , n, we have <7j(0 | x) ^ l/|<Si|- 

We are now ready to derive an upper bound on the mixing time of the Markov chain of the 
logit dynamics for games with dominant strategies. 

Theorem 4.2. Let Q be an n-player games with dominant profile where each player has at most 
m strategies. The mixing time of M^{Q) is t^, = O (m n n\ogn). 

Proof. We apply the coupling technique (see Theorem 12. ip and use the same coupling used in 
the proof of Theorem 13.61 The following properties of that coupling are easily verified. 

1. The coupling always selects the same player in both chains. 

2. If player i is selected for update, the probability that both chains choose strategy s for 
player i is exactly min{<7j(s | x),<Tj(s | y)}. Since, by Observation 14.11 we have that 
<7j(0 | x),<7j(0 | y) ^ l/\Si\, then the probability that the coupling updates the strategy 
of player i in both chains to is at least l/|Si|- 

3. Once the two chains coalesce, they stay together. 

Let r be the first time such that all the players have been selected at least once and let t* = 
2nlogn. Note that, by a Coupon Collector's argument and by Markov's inequality, for all 
starting profiles z and w of the coupled chains, 

P z ,w (r ^ t*) ^ \ . (7) 
We next observe that for all starting profiles z and w, it holds that 

P z ,w {Xv = and Y t * = | r < t*) ^ — . (8) 

Indeed, given that all players have been selected at least once within time t*, both chains are in 
profile at time t* if and only if every player chose strategy in both chains the last time she 
played before time t*. Prom the properties of the coupling and by Observation 14.11 it follows 
that this event occurs with probability at least l/m n . Hence, for all starting profiles z and w, 
we have that 

P z ,w (X t * =Yt*)> P z ,w {X t * = and Y t * = 0) 

^ P z , w (AV = and Y t , = | r < t*) P z , w (r < t*) (g) 

1 1 

> , 

m n 2 ' 

where in the last inequality we used ([8|) and ((JJ) . 

Therefore, by considering k phases each one lasting t* time steps, since the bound in @ 
holds for every starting states of the Markov chain, we have that the probability that the two 
chains have not yet coupled after kt* time steps is 

k 

k/2m" 
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which is less than 1/4, for k = 0(m n ). By applying the Coupling Theorem (see Theorem I2.ip 
we have that t m i x = O (m n n log n). □ 

Remark. We can write the result of the previous theorem by saying that if a game Q has 
dominant strategies then the mixing time of its logit dynamics is O(l), where the asymptotic 
is on (3. 

We next prove that, for every m ^ 2, there are n-player games with to strategies per player 
whose logit dynamics mixing time is O (to" -1 ). Thus the m n factor in the upper bound given 
by Theorem 14.21 cannot be essentially improved. 

Theorem 4.3. For every to ^ 2 and n ^ 2, there exists a n-player potential game with dominant 
strategies where each player has to strategies and such that, for sufficiently large /3, the mixing 
time of the Markov chain of the logit dynamics with inverse noise f3 is t mix = $7 (?n n_1 ). 

Proof. Consider the game with n players, each of them having strategies {0, . . . ,to — 1}, such 
that for every player i: 

jo, ifx = 0; 
1—1, otherwise. 



Ui x 



Note that is a dominant strategy. This is a potential game with potential <l?(x) = — Uj(x) and 
thus the stationary distribution is given by the Gibbs measure in Q. We apply the bottleneck 
ratio technique (see Theorem 12. 7p with R = {0. . . , m — l} n \ {0}, for which we have 

with Z = 1 + e _/3 (m n - 1). It is easy to see that tt(R) < 1/2 for (3 > log(m n - 1) and 
furthermore 



Q(R,R) = ^vr(x)P(x,0) 



x6_R. 

e~P ^ e~P ^ 

^^P(x,0) = ^^P(x,0), 

xG_R xG-Ri 



where R± is the subset of R containing all states with exactly one non-zero entry. Notice that, 
for every x £ R±, we have 

P(x,0) = ± 



n 1 + (to — l)e ^ 
As |-Ri| = n{m — 1), we have 

_ e~P 1 1 
Q(R,R) = —\Ri 



Z n 1 + (to — l)e 
to — 1 



whence 



Z 1 + to - l)e- 



' 4 Q{R,R) 

>i.( m --i).l±6!Lzi)^>i.2^1. □ 

4 to — 1 4 to — 1 
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Max-solvable games. We note that, by using the same techniques exploited in this section, 
it is possible to prove an upper bound independent of /3 for max-solvable games [17], a class 
which contains games with dominant strategies as a special case, albeit with a function that is 
much larger than 0(m n n log n). We omit further details. 

5 Graphical coordination games 

In this section we consider 2x2 coordination games played on a social graph. In a coordination 
game each player has two strategies, and 1, and the payoff are such that the players have 
an advantage in selecting the same strategy. These games are often used to model the spread 
of a new technology j!8j : the two strategies correspond to adopting or not adopting a new 
technology. The game is formally described by the following payoff matrix 



1 



a, a 


c, d 


d, c 


6,6 



For convenience, we set 

So '■= a — d and 5\ := b — c. (11) 

We assume that 5q > and 5± > which implies that each player has an incentive in selecting 
the same strategy as the other player. The game has thus two pure Nash equilibria: (0, 0) and 
(1,1). If 5o > 5i we say that (0,0) is the risk dominant equilibrium; if 5q < S\ we say that 
(1, 1) is the risk dominant equilibrium; otherwise, we say that the game has no risk dominant 
equilibrium. The risk dominant equilibrium concept is a refinement of the concept of Nash 
equilibrium proposed by Harsanyi and Selten [TO]. It is easy to see that a coordination game is 
a potential game with potential function defined by 

0(0,0) = -6 , 0(1,1) = -<5i, 0(0,1) =0(1,0) =0. 

A graphical coordination game is a game in which n players are connected by a social graph 
G = (V, E) and every player plays an instance of the basic coordination game described by 
(fTOj) with each of her neighbors. Specifically, a player selects one strategy that is played against 
each of her neighbors and the payoff is the sum of the payoffs of the instances of the basic 
coordination game played. It is easy to see that the profiles where all players play the same 
strategy are Nash equilibria; moreover, if (x, x) is risk dominant for the basic coordination game, 
then profile (x, . . . ,x) is risk dominant profile for the graphical coordination game. 

We note that graphical coordination games are potential games where the potential function 
<I> is the sum of the edge potentials <£ e . For an edge e = (u, v), the edge potential $ e is defined 
as $ e (x) = (p(x u ,x v ) for edge e = (u,v). 

In Section 15.11 we will show a upper bound on the mixing time of the logit dynamics with 
inverse noise /3 for graphical coordination games that holds for every social graph. Our proof 
is a generalization of the the work of Berger et al. [5] on the Ising model. Then, we focus on 
two of the most studied topologies: the clique (Section 15 . 2[> . where the mixing time dependence 
on e /3A *( 1+ °( 1 * ) ) showed in Theorem 13.41 cannot be improved, and the ring (Section 15 .3[) . where 
a more local interaction implies a faster convergence to the stationary distribution. 

In the rest of this section we will assume without loss of generality that 5o ^ 5\ . 

5.1 Graphical coordination games on arbitrary graphs 

In this subsection we give a general bound to the mixing time of Markov chain of the logit 
dynamics with inverse noise (3 for graphical coordination games that depends on the cutwidth 
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of the social graph G. For an ordering £ of the vertices of a graph G, we write "j <g h" to 
denote that vertex j precedes vertex h in the ordering £ and define, for every vertex i, 

Ef = {(j,h) €E\j ^ii<th}. (12) 

Finally, the cutwidth x(fy of ordering £ is xi$) = max i and the cutwidth of G is 

X (G)=min X (i), (13) 

where C is the set of all the orderings on the vertices in V. 
The following theorem is the main result of this subsection. 

Theorem 5.1. Let Q be an n-player graphical coordination game on social graph G. The mixing 
time of the Markov chain of the logit dynamics with inverse noise (3 for Q is 

^mix < 2n 3 e* (G)(<5o+<5l)/3 (n<5 /3 + 1) . 

The proof is based on the Canonical Paths method (see Theorem 12. 6p . Fix an ordering £ 
of the vertices {1, 2, . . . , n} of G and rename the vertices so that 1 <£ 2 <£ ■ ■ ■ <£ n. We next 
define a set of paths Y = {T^ y }, one for every pair (x, y) of profiles, and relate the congestion 
of to the cutwidth of £. Specifically, the path r^ y from x to y relative to ordering I is defined 
as follows. Denote by i\ . . . , id, with < d ^ n, the components in which x and y differ ordered 
according to £. That is, for j = 1, . . . , d, we have xi j ^ y^ and i\ <£ i% <£ ■ ■ ■ <e id- Then we 
set r^y = (z°, z 1 , • • • , z d ), where z° = x, z d = y and, for < j < d, 

= (yi,'" ,yij,x ij+ i,- ■ ■ ,x n ). 

Notice that, since x and y coincide in all components from ij + 1 to ij+x — 1, we can also write 
z J as 

We will also find convenient to define, for each edge (u, v) of r^ y , ff v \(x, y) as follows 
f t , s. J ,Xi-l,Vi, ■■■ ,yn), if vr(u) ^ vr(v) ; 

/( U ,v)( x >y) = < f v , , ^ , , ( 14 ) 

{{xi, ■ ■ ■ ,Xi,y i+ i, ■ ■ ■ ,y n ), if vr(u) > vr(v) , 

where % denotes the component in which u and v differ. 

The following two technical lemmas establish two useful properties of the paths in V and 
of the function f e that will be useful to prove Theorem 15.11 

Lemma 5.2. For every pair of profiles x, y G S, for every ordering £ of the vertices of G and 
for every edge (u, v) € r£ y , 

mm{7r(u), 7r(vJ) 

where z = ff v ^ (x, y) and i is the component in which profiles u and v differ. 

Proof. We only prove the lemma for the case 7r(u) ^ tt(v), the proof for the other case be- 
ing similar. Since (u, v) is an edge of T*^ y then by definition of the path we have that 
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u = (yi,--- ,yi-i,Xi,--- ,x n ). Also, by the definition of the function f^ uv ^ we have z 
(xi, • • • , Xi-i, yi, ■ ■ ■ , y n ). Now observe that 

7r(x) • 7r(y) 7r(x) • 7r(y) 



min{-7r(u), 7r(v)} • vr(z) tt(u) • 7r(z 

-/3(*(x)+3 

-/*E £6C (^(x)+* £ (y)-* £ (u)^ e (z)) 



, /3(*(x)+$(y)-$(u)-*(z)) (16) 



Notice that for every edge e = (a, 6) of the social graph G, the potential & e depends only on 
components a and 6 of the profile. It is thus sufficient to prove that, for every edge e = (a, b) of 
the social graph G, 



® e (u a ,U b ) + $ e (>a, Z h ) - $ e {Xa,Xb) ~ $e(Z/a, 2/6 ) 



= 0, if e i Ei\ 

Oi + 5 , if e G Ei. 



If e ^ £7j then either a,b ^£ i or a, 6 >^ i. In the first case, we have that u a = y a and u b = yb 
and z a = x a and z b = x b . Hence 

$e(x a , Sfc) = $e(z a , Z b ) and $ e (f/a, 2/6 ) = $e(u a , Ub) ■ 

In the second case, we have that u a = x a and Ub = Xb and z a = y a and Zb = yb- Hence 

$ e (x a , X b ) = ® e (u a ,U b ) and $e(y a , yb) = $e(z a , Z b ) . 

Let us consider the case e £ Ei and assume without loss of generality that a <£ i <£ b. Therefore 
we have 

z a = x a , z b = y b and u a = y a , u b = x b 
and thus we consider quantity 

M = ® e (y a , Xb) + $ e (Xa,yb) ~ $e(x a , X b ) ~ $ e (j/a, 2/6 ) 

and prove that M ^ 5\ + 5q. If both addends are equal to —5q then we have x a = x b = and 
Ua = 2/6 — and thus M = 0. In the remaining cases, at most one of the two negative addends 
is equal to — 5q and the other is either of —Si and thus the claim holds. □ 

Lemma 5.3. For every edge e = (u, v) and for every ordering I of the vertices, the function 
fe is injective. 

Proof. Let z = /e( x ! y) De a profile in the co-domain of /J. We show that x and y are uniquely 
determined by z and e. Assume, without loss of generality, that tt(u) ^ tt(v) and suppose that 
u and v differ in the i-th component. Then (x±, . . . , = (z±, . . . , Zi-i) and (yi, . . . , y n ) = 
(zi, . . . , z n ). Moreover, since e belongs to the path r£ iy then v = (yi, . . . , 2/i-i, x^, . . . , x n ) and 
thus the remaining components of x and y are also determined. □ 

The following lemma gives an upper bound to the congestion p(T ) of the set of paths T l 
relative to order t. 

Lemma 5.4. 

p(T e ) ^ 2n 2 e^ 5 ° +s ^. (17) 
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Proof. Since 



we can state that 



Thus, 



P(u,v) = - 



■WW 



-/3min{0(u),</>(v)} 



n e -W(v) + e -/30(u) - 

min{-7r(u), vr(v)} 



Q(u,v) ^ 



2ra 



max 

.M — edge (u,v) 



(18) 



1 



V 



Q(u,v) 



(by EqUHland |r^ y | ^ n) 



(by Lemma [53 and |Sf| sC x(^)Vi) 



^ max 

M— edge (u,v) 



\ 



x,yi 



x,y: 
(u,v)er* y 



2n A 



£ 

x,y : 

V Hefi 



tt(x) • ?r(y) 
min{-7r(u), vr(v)} 



^ max 

Al— edge e 



max 

.M—edgc e 



/ 
\ 



2r? £ 



x,y: 
eer x,y 



7T /i(x,y; 



2n z e 



2 /3(<5 +5i)xM 



£ ^(/eW)) 



x,y: eer* 



(by Lemma 
Proof of Theorem I5.il By Lemma 



< 2n 2 e^W( 5o + 51 ). □ 
] we have that there exists a set of paths T with 
p(T) ^ 2n 2 e^ So+8 ^ 
and by Theorem 12.61 and Theorem I3.1|. we obtain that the relaxation time of A4@ 

if el < 2n 2 e^ So+s ^ . 
The theorem then follows by Theorem 12.31 and by observing that 

^min < Z 3 ^ + lo S \S\ < + log2 n ^ n(f3n5 + 1) . □ 

5.2 Graphical coordination games on a clique 

We now focus on graphical coordination games played on a clique, a very popular network 
topology where every player plays the basic coordination game in (jlOp with every other player. 
Since the cutwidth of a clique is 0(n 2 ), Theorem 15.11 gives us an upper bound to the mixing 
time of the Markov chain of the logit dynamics with inverse noise (3 for this class of games that 
is exponential in n 2 (5o + 5\)j3. 

However, since the game is a potential game we can obtain almost tight bounds by using 
Theorems 13.81 and 13.91 Indeed, it is easy to see that the potential depends only on the 
number of players playing strategy 1 and not on their position on the social network: if there 
are k players playing 1, then there will be k(k — l)/2 edges in the network in which both 
endpoints are playing 1, each one contributing —Si to the potential, and (n — k)(n — k — l)/2 
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edges whose endpoints are playing and hence contributing — 5q. Hence, for a profile x with k 
players playing 0, the potential «J?(x) = — fc )(^ k l ) g Q _(. g l j 

By simple algebraic analysis, it turns out that the maximum of the potential $ max is attained 
when k* players are playing strategy 1, where k* is the integer closest to (n — 1) s ^ + | , and 

<&(x) monotonically increases as w(x) goes from n to fc* and then monotonically decreases as 
w(x) goes from k* to 0. Hence, it turns out that for every pair of profiles x, y with vr(x) ^ vr(y) 
and any path 7 between x and y, the maximum increase of the potential function along 7 is 

m \ ^ J $ max - $(x) if w(x) < k* and w(y) > fe* or viceversa; 

^ 1 */ \ */ \ 4.1. ■ ( 19 ) 
I *(y) ~~ ^(x) otherwise. 

Moreover, for any pair of profiles there exist at least a path (e.g., the bit-fixing path) for which 
(|19|) holds with equality. Thus, for the graphical coordination game on a clique, since Sq ^ Si, 
we have $(0) ^ 3>(1) and hence ( = $ max — <3?(1). Then, by Theorems 13.81 and 13.91 we can 
obtain the following bound to the mixing time of the Markov chain of the logit dynamics with 
inverse noise f3. 

Theorem 5.5. For every graphical coordination game on a clique the mixing time of the Markov 
chain of the logit dynamics with inverse noise /3 is 

(7/3(*max-*(l)) <; ^ <; £)/3($max-*(l))<5l 

where C,D = Op(l). 

Note that <& ma x — ^(1) depends on the ratio g^g 1 , with the worst case being when no risk 
dominant strategy exists (i.e., 5q = Si): in this case s S + Sl = \ and $ max — ^(1) = &(n 2 5i). 

5.3 Graphical coordination games on a ring 

In this section we give upper and lower bounds on the mixing time for graphical coordination 
games played on a ring for basic coordination games that admit no risk dominant strategy 
(<5o = Si = 5). Unlike the clique, the ring encodes a very local type of interaction between the 
players which is more likely to occur in a social context. We show, for every /3, an upper bound 
that is exponential in 25/3 and polynomial in ralogn. This improves on the bounds that can 
be derived by applying Theorem 15.11 or Theorem 13.81 Indeed, since the cutwidth of the ring is 
2, Theorem 15.11 would give an upper bound exponential in AS/3 and polynomial in n 4 S(3. Also, 
Theorem 13.81 only holds for sufficiently large f3. 

The proof of the upper bound uses the path coupling technique (see Theorem 12. 2p and can 
be seen as a generalization of the upper bound on the mixing time for the Ising model on the 
ring (see Theorem 15.4 in |12j). 



Theorem 5.6. Let Q be an n-player graphical coordination game played on a ring with no risk- 
dominant strategy (Sq = Si =5). The mixing time of the Markov chain of the logit dynamics 
with inverse noise f3 for Q is 

tL v = o(e 25 Pn\oen, 



mix 



Proof. We identify the n players with the integers in {0, . . . , n — 1} and assume that every player 
i plays the basic coordination game with her two adjacent players, (i — 1) mod n and (i + 1) 
mod n. The set of profile is S = {0, l} n . 

Let us consider two adjacent profiles x and y differing in the j-th component and, without 
loss of generality, assume that Xj = 1 and yj = 0. We consider the following coupling for two 
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chains X and Y starting respectively from Xq = x and Yq = y: pick i € {0, . . . ,n — 1} and 
U € [0, 1] independently and uniformly at random and update position i of x and y by setting 



^ JO, if [/ < <7i(0 | x); fO, if ?7<<7i(0|y); 

Xi \l, if U>*i(0 |x); ^ \l, if £/ > (7/(0 | y). 

We next compute the expected distance between X\ and Yi, the states of the two chains after 
one step of the coupling. Notice that crj(0 | x) only depends on and xi + \ and <7j(0 | y) only 
depends on yi-\ and y-i+i- Therefore, since x and y only differ at position j, o"j(0 | x) = o"j(0 | y) 
for i ^ j - l,j + 1. 

We start by observing that if position j is chosen for update (this happens with probability 
1/n) then, by the observation above, both chains perform the same update. Since x and y differ 
only for player j, we have that the two chains are coupled (and thus at distance 0). Similarly, if 
i j — 1, j,j + 1 (which happens with probability (n — 3)/n) we have that both chains perform 
the same update and thus remain at distance 1. Finally, let us consider the case in which 
i 6 { j — l,j + 1}. In this case, since Xj = 1 and yj = 0, we have that c"j(0 | x) ^ <r.;(0 | y). 
Therefore, with probability <7j(0 | x) both chains update position i to and thus remain at 
distance 1; with probability 1 — <7j(0 | y) both chains update position i to 1 and thus remain at 
distance 1; and with probability o"j(0 | y) — <7j(0 | x) chain X updates position i to 1 and chain 
Y updates position i to and thus the two chains go to distance 2. By summing up, we have 
that the expected distance E[p(Xi,Yi)] after one step of coupling of the two chains is 

EHXuY^} 



n 




3 




1 
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n 
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+ 
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n 






n 



J2 (i + ^(o|y)-^(o[x)) 

ie{j-i,j+i} 

]T (a i (0\y)-a i (0\x)). 
ie{i-i,i+i} 

Let us now evaluate the difference (7,(0 | y) — <Tj(0 | x) for i = j — 1 (the same computation 
holds for i = j + 1). We distinguish two cases depending on the strategies of player j — 2 and 
start with the case Xj-2 = Vj-2 = 1- In this case we have that 

0j-i(O I x) = 1 + 1 e2(5/3 and ct.,_i(0 | y) = - . 

Thus, 

. . . . . . 1 1 

o-;-i(0 | y) - Oj-_i(0 I x) = - - 1 + e2g/3 - 

If instead £j-2 = 2/j-2 — 0, we have 

0-i-i(o I x ) = ^ and o-j-i(o I y) = 1 + ■ 

Thus 

o-j-i(o I y) - ^-i(o|x) = - - e _ 28f3 - - 
i 11 i 

= i 



l + e 25 /3 2 2 l + e 2 ^ 
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We can conclude that the expected distance after one step of the chain is 

E[p(X 1 ,Y 1 )] = ^l + i(l 
n n \ 



1 + e 2 ^ 



1 



n (l + e 2«5/3) 

2 

< g „(l+ e 2«/3) _ 

Since the diameter of G is diam(G) = re, by applying Theorem [22] with a = n M+ e 25p\ > we obtain 
the theorem. □ 

The upper bound in Theorem 15.61 is nearly tight (up to the nlogn factor). Indeed, a lower 
bound can be obtained by applying the Bottleneck Ratio technique (see Theorem I2.7P to the 
set R = {1}. Notice that n(R) ^ \ since profile has the same potential as 1. Thus set R 
satisfies the hypothesis of Theorem 12.71 Simple computations show that the bottleneck ratio is 



B(R) = J £ p ( 1 >y) = TT^- 



Thus, by applying Theorem 12.71 we obtain the following bound. 

Theorem 5.7. Let Q be a n-player graphical coordination game on a ring with no risk- dominant 
strategy. The mixing time of the logit dynamics for Q is t m { x = Q (l + e 2<5 ^) . 



6 Conclusions and open problems 

In this chapter we give different bounds on the mixing time of the logit dynamics for the class of 
potential games: we showed that the mixing time is fast when (3 is small enough and we found 
the structural property of the game that characterizes the mixing time for large /?; finally we 
showed a bound that holds for every value of /3. Unfortunately, the last bound does not match 
the previous ones in every game: thus it would be interesting to have an unique bound that 
holds for every values of /3 and matches with bounds given in Theorems 13.61 and 13.81 

On the other hand, we show that there exists a class of games, namely dominant strategy 
games, such that the mixing time of the logit dynamics does not grow indefinitely with the 
inverse noise. However, it would be interesting to characterize the class of games for which this 
holds: a natural candidate for this class may be the set of potential games where £* = 0. 

Finally, we consider coordination games on different topologies of graphs, where we give 
evidence that the mixing time is affected by structural properties as the connectedness of the 
network. 

The main goal of this line of research is to give general bounds on the logit dynamics mixing 
time for any game, highlighting the features of the game that distinguish between polynomial 
and exponential mixing time. We stress that, when the game is not a potential game, in general 
there is not a simple closed form for the stationary distribution like Equation @. 

At every step of the logit dynamics one single player is selected to update her strategy. It 
would be interesting to consider variations of such dynamics where players are allowed to update 
their strategies simultaneously. The special case of parallel best response (that is j3 = oo) has 
been studied in [T7]. Another interesting variant of the logit dynamics is the one in which the 
value of (3 is not fixed, but varies according to some learning process by which players acquire 
more information on the game as time progresses. 
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When the mixing time of the logit dynamics is polynomial, we know that the stationary 
distribution gives good predictions of the state of the system after a polynomial number of time 
steps. When the mixing time is exponential, it would be interesting to analyze the transient 
phase of the logit dynamics, in order to investigate what kind of predictions can be made about 
the state of the system in such a phase. Preliminary results along this direction has been 
achieved in [2]. 
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